A Multi-Dictionary Approach to Abstractness/Concreteness-Based Authorship Attribution
نویسندگان
چکیده

 We present some early results from a research project aimed at exploring the usefulness of abstractness/concreteness as stylistic features for authorship attribution. conjecture that authors use abstract/concrete words and phrases in suf- ficiently unique ways, so machine learning classifiers can learn to distinguish individual authors’ writing styles. Our approach is based on using abstractness rat- ings texts with established au- thorship generate training vectors different classifiers. The combined word/phrase ratings are extracted two separate dictionaries – an yields stronger than single ab- stractness dictionaries. paper describes details our methodology compares those obtained traditional attribution features. limitations current directions further outlined end paper.
منابع مشابه
A Web-Based Self-training Approach for Authorship Attribution
As any other text categorization task, authorship attribution requires a large number of training examples. These examples, which are easily obtained for most of the tasks, are particularly difficult to obtain for this case. Based on this fact, in this paper we investigate the possibility of using Webbased text mining methods for the identification of the author of a given poem. In particular, ...
متن کاملThe Computational-Linguistic Approach to Forensic Authorship Attribution
This article examines the diversity of methods in authorship attribution through a lens which focuses attention on a single common element. The current state of authorship attribution study is spread throughout so many academic and non -academic disciplines that it is nigh impossible to describe all of the various assumptions about language and authorship. The disciplines involved in authorship...
متن کاملAuthorship Attribution
Authorship attribution, the science of inferring characteristics of the author from the characteristics of documents written by that author, is a problem with a long history and a wide range of application. Recent work in “non-traditional” authorship attribution demonstrates the practicality of automatically analyzing documents based on authorial style, but the state of the art is confusing. An...
متن کاملOBA2: An Onion approach to Binary code Authorship Attribution
A critical aspect of malware forensics is authorship analysis. The successful outcome of such analysis is usually determined by the reverse engineer’s skills and by the volume and complexity of the code under analysis. To assist reverse engineers in such a tedious and error-prone task, it is desirable to develop reliable and automated tools for supporting the practice of malware authorship attr...
متن کاملAn Off-the-shelf Approach to Authorship Attribution
Authorship detection is a challenging task due to many design choices the user has to decide on. The performance highly depends on the right set of features, the amount of data, in-sample vs. out-of-sample settings, and profilevs. instance-based approaches. So far, the variety of combinations renders off-the-shelf methods for authorship detection inappropriate. We propose a novel and generally ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... International Florida Artificial Intelligence Research Society Conference
سال: 2023
ISSN: ['2334-0762', '2334-0754']
DOI: https://doi.org/10.32473/flairs.36.133262